Versions:

  • 1.0.0
  • 0.2.5
  • 0.2.4

Chough is a lightweight, open-source command-line automatic speech recognition utility published by hyperpuncher that delivers exceptionally fast transcription by coupling the highly optimized Parakeet TDT 0.6b V3 acoustic model with the efficient sherpa-onnx inference engine. Designed for developers, researchers, and production pipelines that need low-latency, memory-conscious speech-to-text conversion, the program processes audio in configurable chunks, keeping RAM usage minimal while still achieving near real-time throughput on CPU-only systems. Typical use cases range from subtitling long media files and indexing podcast archives to enabling live-captioning back-ends and batch-transcribing telephone recordings for analytics; because it streams audio through overlapping segments, Chough can handle hours-long content without loading entire waveforms into memory, making it suitable for both workstation and modest cloud instances. The current stable release, version 1.0.0, refines the default chunking parameters and improves punctuation recovery, while the three numbered versions maintained in the catalog let users pin deployments to earlier behaviors when reproducibility is critical. As a member of the Audio & Music transcription category, the tool accepts common formats such as WAV, FLAC, and MP3, emits plain or JSON output, and exposes command-line flags for beam width, hot-word boosting, and speaker diarization hooks, allowing straightforward integration into shell scripts, FFmpeg workflows, or larger Python services via subprocess calls. The software is available for free on get.nero.com, with downloads provided via trusted Windows package sources (e.g. winget), always delivering the latest version, and supporting batch installation of multiple applications.

Tags: